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Simple construction of 
quantum universal variable-length source coding 
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QQ ■ We simply construct a quantum universal variable-length source code in which, 

. independent of information source, both of the average error and the probabihty 
that the coding rate is greater than the entropy rate H{'pp), tend to 0. If H{'pp) 

^ ■ is estimated, we can compress the coding rate to the admissible rate H{'pp) with 

Tij" , a probability close to 1. However, when we perform a naive measurement for the 

I estimation of //(p^), the input state is demolished. By smearing the measure- 

' ment, we successfully treat the trade-off between the estimation of H(j)p) and the 

^) ■ non-demolition of the input state. Our protocol can be used not only for the Schu- 

' macher's scheme but also for the compression of entangled states. 

^: 

1 Introduction 

^ : 

^ . When we compress our data in a classical computer, we usually use not a fixed-length 
code, but a variable-length source code like gzip. In the quantum case, according to 
^ I Schumacher's resultsjll], when our quantum data obeys independent identical distribution 
i (i.i.d.) of a probability p of quantum states, we can compress our data up to the entropy 
■ rate of the average density operator defined as the mixture of the probability p. However, 
. . his protocol is not applicable to the case where we do not know the average density 
operator, which the construction of the protocol is dependent on. Using representation 
theory of unitary group, Jozsa and Horodecki family|0] constructed a quantum universal 
fixed-length code, and it is efficient in the i.i.d. case when the entropy rate of the source is 
less than the rate of the code. Otherwise, this protocol demolish the state unrecoverablly. 
(The optimality of their code among quantum fixed-length codes is proven not only in 
the sense of the compression rate, but also in the sense of error exponent by Hayashi |^.) 

Hence, a quantum universal variable-length code, that does not depend on the rate is 
more desired. Of course, in such a code, the coding rate must not be determined a priori 
and it must be decided from the input state. While this decision does not change the 
source in the classical case, it does cause the destruction in the quantum case because 
this decision requires a quantum measurement. Therefore, we treat the trade-off between 
the compression rate and the non-demolition. While this type code was thought to be 
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impossible by some researchers^, it was constructed by Hayashi and MatsumotoQ as 
the following strategy. 

First, we consider the optimal measurement for the decision of the coding length (the 
estimation of the entropy rate) in the sense of the large deviation (also optimal in the 
sense of mean square error, in many cases). For such an optimal measurement demolishes 
the state, an unsharp measurement, generated by smearing out the optimal measurement, 
is considered. Such an unsharp one is also optimal in the sense of the large deviation while 
it is not optimal in the sense of the mean square error any more. The previous paper ||^ 
used such a smeared measurement, but the smearing process is too complicated to give 
a clear insight to the essence of the protocol. In this paper, by constructing a quantum 
variable-length code from a quantum universal fixed-length code, we clarify the trade-off 
between the compression rate and the non-demolition. 

To construct optimal code in the sense of overflow exponent, we make heavy use of 
the group representation type theory, which is not necessary to achieve the optimal com- 
pression rate. For example, one can replace our group representation theoretic estimation 
of the entropy of the average state by the one based on a tomographic estimate of the 
state. In this case, evaluation of the protocol will be done by usual type theory (no room 
for group representation!). 

Based on the idea of smeared measurement of Hayashi and Matsumoto [^] , Presnell and 
Jozsa[^ [7| developed the following strategy. We decide the coding length for every com- 
putational basis and compress the input state by using a classical compression algorithm 
under the computational basis having the minimum coding length. Since the demolition 
of the state is unavoidable in their approach, they applied the above smearing method to 
their approach, and successfully constructed a quantum universal variable-length source 
code. Their code achieves optimal compression rate not only for the i.i.d. case but also for 
other cases. For example, when states generated from the source is orthogonal with each 
other, their method is useful though the probability is not i.i.d. but Markov. However, 
their method has the following drawbacks. While this paper and the paper optimize 
the decreasing exponent of the overflow probability, their paper did not deal with this 
type optimization. 

The essential point of this paper is the construction of a quantum universal variable- 
length code from a quantum universal fixed-length code which satisfies the large deviation 
principle. In the present protocol, when quantum information sources generate non- 
orthogonal states with a classical Markov chain probability, it seems difficult to achieve 
the optimal compression rate. (Of course it seems difficult also in Presnell and Jozsa's 
protocol.) However, if we successfully construct a quantum universal fixed- length code 
for this case which satisfies the large deviation principle, we seem able to construct a 
quantum universal variable-length code for this case by modifying the quantum universal 
fixed-length code. 

This paper is organized as follows. First, we review quantum fixed-length source code 
which contains quantum universal fixed-length source code. Second, we give a precisely 
definition of quantum universal variable-length source code. Next, we construct a more 
simple code from a quantum universal fixed-length code. Finally, we discuss a application 
of our protocol to a compression of entangled states. 



2 Review of quantum fixed-length source coding 



Let 7i be a finite-dimensional Hilbert space that represents the physical system of interest 
and let SilH.) be the set of density operators on Ti. Consider a source which produces 
the state Pn '■= Pi ® P2 ® ■ ■ ■ ® Pn with the i.i.d. distribution of the probability p on 
states. In fixed-length source coding, a sequence of states pn is compressed to the state in 
a smaller Hilbert space Tin C Ti*^", whose dimension is e^^. Here, the encoder and the 
decoder is a trace-preserving completely positive (TP-CP) map and D^, respectively. 
The average of the total error is given by 



where Bures' distance is defined as b{p,a) := y 1 — Tr |y^^/o"|- ^^is setting, we focus 
on the infimum of the rate with which the average error goes to zero. The infimum is 
called the minimum admissible rate Rp of p, and is defined by 



Rp := ini { lim sup — log dim Hn 

n 



3{(7^„,E-,D")}, 



As was proven by Schumacher [Q, and Jozsa and Schumacher [Q, and Barnum et al. 0, 
when every pi is pure, the equation Rp = H(j)p) := — Trp^logpp holds, where Pp := 
'^pGS{H)P(p)P- Moreover Jozsa et al. constructed the projections Pn^n for a arbitrary 
rate R such that 

- log rank P^,„ R, Tr Pfi,„p^" ^1, (1) 
n 

for any density matrix p satisfying H{p) < R. Using the above projections, they proposed 
a quantum universal fixed-length source code depending only on the entropy rate as follows. 
The encoder E"' is defined by 

E^ipl) := Pfi,„p-;Pfi,„ + (Tr(/ - Pfi,„)Pn) |0)(0|, 

and the decoder is defined as the embedding. 

Hayashi more precisely evaluated the the performance of their code as follows0. We 
can chose a projection Pr „ such that 

rank Pfi,„ <{n+ if^n + df'^e''^ <{n + (2) 
1 - Tr Pfi,„p®" <{n + rf)^'^exp ( -n min D(b||a) ) , (3) 

V H{h)>R J 

where a denotes the probability distribution consisting of the eigenvalues of p^ and b 
denotes another probability distribution. Therefore, the average of the total error is 
evaluated by 

e{El,Dl) < 2(n + d)4'^exp (^-n ^min^ Z}(b||a)^ , (4) 

which goes to when R < Hij)^). The inequality @) was proven in the pure state case 
in Hayashi H, but as is proven in Appendix, it holds in the mixed state case. This type 



evaluation was essentially done by Keyl and Werner 1 10 



3 Quantum universal variable- length source coding 



In the classical system, depending on the input state, the encoder can determine the 
coding length. Such a code is called a variable-length code. Using this type of code, we 
can compress any information without error. Lynch and Davisson |]12| proposed a 



variable-length code with no error, in which the coding rate is less than H(p) except for 
a small enough probability under the distribution p. Such a code is called a universal 
variable-length source code. Today, their code can be regarded as the following two-stage 
code: at the first step, we send the empirical distribution which indicates a subset of data, 
and in the second step, we send information which indicates every sequence belonging to 
the subset. 

This paper deals with quantum data compression in which the encoder determines 
the coding length, according to the input state. In order to make this decision, he must 
measure the input quantum system. As is known, any quantum measurement is described 
by POVM M = {M^jj^gQ. When the data set Q is discrete, we may describe the state 
evolution of a quantum measurement M as follows while we need an instrument, i.e., CP- 
map valued measure in the general case[]T3. When we perform a quantum measurement 



M = {M^j^gQ to the quantum system whose state is a density operator p, we obtain the 
data u ^ Q with the probability Tr pM^ and the final state is y/M^py/MZ/ Tt pM^. An 
encoding process after the measurement is described by a TP-CP map E^^. Therefore, any 
encoder is given by E^(/9) := E^{\fM^p\fM^^ (E = {E^j^g^). The decoder is given 
by a set of TP-CP maps D = {D^^j^^gn, which presents the decoding process depending 
on the data uj. A pair of an encoder E = {E^j^gQ and a decoder D = {D^^l^^gQ is 
called a quantum variable-length source code on 7i. The coding length is described by 
log |f2|-|-log dimTCoj, which is a random variable obeying the probability Pp (cu) := Tr Ej^(p) 
when the input state is p. 

When the state p„ on Ti®" obeys the i.i.d. distribution of the probability p on 
states, the error of decoding for a variable- length code (E", D") on TC^^ is evaluated by 
Bures' distance as 



I:t.e:,,.,^(,„,o:„(^)) 



and the average error is given by 

6„,p(E^D'^) := p^(pr^) E TrEi(p.)6^ U,^:^ ( ^kn7n )) ■ 

In this case, the data uj^ obeys the probability: 

Pn,;K) ■■= E P^^M TrEKp-n) = Tr EIJJp^®'^). 

A sequence {(E",D"')} of quantum variable-length source code is called universal if 
e„,p(E"', D") — > for any probability p on states. 

As mentioned latter, there exists a quantum universal variable-length source code 
{(E",D")} satisfying 

hmP^; 1^ (log + logdimT^^J > H{-p^) + e| = 



A 



for any e > 0. Conversely, if a quantum variable-length source code {(E",D")} is uni- 
versal and 

limPfp |i (log IQnl + logdimT^^J > i?| = 0, 

then R > Rp = H{'Pp). Moreover, concerning the exponent of the overflow probability 
P^p (log l^nl + log dim 7ia;„ ) > R} the following theorems hold0. 

Theorem 1 There exists a quantum universal variable-length source code {(E", D")} on 
7^®" such that 

lim — logPfJ-(log|n„| +logdim7^^J > r] = inf D{h\\a), (5) 



n " '''^ {n ' ■ " ■"' J h:H{h)>R 

where a is a probability distribution consisting of the eigenvalues of 'Pp, and b denotes 
another probability distribution. D{h\\a) is relative entropy. 

Theorem 2 If a sequence {(E",D")} of quantum variable-length source codes on 'H®^ is 
universal, then 

limsup — logPf„|-(log|fiJ+logdim7i:^J >i?l < inf D{h\\a) (6) 

n yn J h:H{h)>R 

Therefore, the RHS of (Bl) is the optimal exponent of the overflow probability. 



4 Construction of a quantum variable- length source 
code 

First, for an intuitive explanation of our construction, we naively construct a good 
variable-length code. For this construction, we fixed a strictly increasing sequence a : = 
{o^ijiii of real numbers such that = ai < ^2 < • • • < < = logc?. We define the 
encoder E"'" with the data set {!,...,/} by 

p"." p _ p 

Ef'"(p„) := PrPnPr. Pn e 5(7^^"), 

and define the decoder D"'" as the embedding to 7i®". Assume that H{j)p) belongs to 
the interval [a,, aj+i). As is guaranteed by if H(j)p) does not lie on the boundary on 
the open interval the probability Trpp"P"'" tends to 1. Thus, we can prove 

e„,p(E"'", D"'") — > 0. Of course, if we choose Oj+i — ai to be sufficiently small, the coding 
length is close to the entropy H(j)p) with almost probabihty 1. However, if H(j)p) lies on 
the boundary, i.e. H(j)p) = a^, the state is demolished, as is caused by the same reason 
of Lemma 2 in P|. In this case, we can prove lime,i_p(E"'", D"'") > 0. Thus, it is not 
universal. 

For the non-demolition of initial states, we construct a variable-length code, by choos- 
ing the integer k such that < k < 6n at random where 6 := logd/{l — 1). Depending 
on the integer k, we define d{k/n) = {a{k/n),i}''^\ as 

a{k/n)i = k/n + {i - 2)6 if i = 2,... 1. 

a{k/n)i = 0, a{k/n)i^i = logd, 



and use the encoder E""*^^/""^'" and the decoder D"*-'^/"-''". In this protocol, the probabihty 
that H(j)p) lies on the boundary goes to zero. Therefore, the above code seems a quantum 
universal variable-length code. 

In the following, we give a mathematical definition the above code, evaluate its per- 
formance, and prove the optimality of its optimality of the exponent of the overflow 
probability by choosing 6 (or /) depending on n. We define the data set and the encoder 
E-^'" as 

■■= {keZ\0 <k< Sn} x {1, 2, . . . , /}, 
and the decoder D^'" as the embedding, i.e., we perform the measurement 



[n6] 



pd(k /n) ,n 



k,i 



in the encoding process, where [x] is Gauss notation i.e., [x] is the maximum integer n 
satisfying n < x. 

Its performance was evaluated as follows: 

e.,(E^-, D^-) < 1 - i^^^^ppl (l - (n + exp(-nC5'^' ^ 
Pfp" (- (log \^n\ + logdim7^fc,i) > i?l < (n + dfe^v \ -n min D{h\\ai) 



(7) 



^(b||a) 



M((iog<i)/* + i)^ 



where 5' is arbitrary real number satisfying < 25' < 6. The above inequalities are 
proven in Appendix. When we choose S and 5' as 5„ := and 5'^ := 

f{n,6n)/n goes to 0. Thus, we obtain 



e„,p(E^"'",D'5'")^0 (10) 



"1 - _^17H5,T1 [ 1 



liminf — logP^„ <^-(log|fi„|+logdim7^fc,,) >i?^ > inf D(b||a), (11) 

n '-^ I n J b:H{h)>R 



which imply Theorem 0. 



5 Compression of entangled states 

Next, we consider another compression problem in which we compress an entangled state 
by local operations. We apply our protocol given in section 4 to this problem. Assume 
that we share an entangled state which is the tensor product of a pure state 



on the composite system T-Ca^'Hb- We want to save the dimension of the quantum system 
of Ahce, in the situation where only local operations of Alice's system are allowed. In a 
fixed-length compression, our operation is described by a triplet (7i„, E^, D^) consisting 
of a subspace Tin of 7i®", an encoder, i.e., a TP-CP map from Tif^ to 7i„, and a decoder 



i.e., a TP-CP map from Tin to Ti 



A ■ 



Its performance is characterized by the coding 



length logdimT^^ and the Bures' distance fo^d^) /^'^ o ® where 
Ib is the identity operator on Tis- Now, we define the following value as the bound of 
the asymptotic performance: 



R\<t>){<t>\ := inf <^ limsup — log dim H„ 



3{(H„,E^D")}, 



then we can prove -R|0)(0| = -ff(TrB as follows. If i? > //(Tr^ then 

Tr(PR,„ (8) = TrAP/?,„TrB ^ 1. Therefore, the encoder and 

the decoder defined in section 2 satisfies o ® (01®")) ^ 0. 

Conversely, we assume that a sequence i?", D")} satisfies 



62(|0)(0|®",D"o£;"®/|"(|0)(0|®")) ^ 0. 



(12) 



The entanglement of formation of the compressed state J^"(|0)(0|®")) is less than 

logdim?i„, and the entanglement of formation of the original state £'j-(|0)(0|®") equals 
nH {Ti B {<p\) ■ (Concerning the entanglement of formation, please see, for example, 
Hayden et.al.[jn[.) Thus, similarly to Hayden et.al.|T^, by using (|T2|) and the continuity 



and the monotonicity of the entanglement of formation, we can prove 

limsup - log dim > limsup -E.(E" ® /r(l0)(0r")) 
n n 

> limsup -E^(D" o ® /r(|0)(0|®"))) = limsup -Ey(|0) (0|®") = //(Tr^ |0)(0|). 



n 



n 



However, the above protocol cannot be used when the entropy rate H(Tyb is 
unknown. In the following, we consider the case where we share a tensor product state 
p^n ig f^i^Q tensor product of a general state p on the composite system TiA ®TiB- 

We apply our protocol given in section 4 to the case where the entropy rate H{Ttb p) is 
unknown. In this situation, the coding length is variable, the performance is characterized 
by the distribution of the coding length P^^" and the average of error e„^p(E'''", D^'"), 
which are defined by 



= TrE:jJpn 
e.,,(E",D") ■.= Y.vZMTTKSPnh' 



TrE" (p^") 



As a general setting which unifies the above setting and the setting given in section 3, we 
consider the setting where a general state on the composite system TiA ®TiB is generated 
with the probability p{p). In the i.i.d. extended setting, a state Pn '■= Pi ® P2 ® ■ ■ ■ ® Pn 
on the tensor product system TiJ^ Ti%^ is generated with the i.i.d. probability p". 
In this setting, the probability of the coding length P^p" and the average of the error 
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en,p{E^''', D"^'") are defined by 



Pn 



e„,(E", D") := $:i>"(p„) K) Tr E::„(p„)6^ (p„, B^^ « /- {^^^) 

Since the case dimT-^^ = 1 is equivalent to the setting in section 3 and 4, this setting 
is a generahzation of not only the above setting but also the setting in section 3 and 4. 
Moreover, the encoder E*^'" and decoder D*^'" proposed in section 4 satisfies 



e„ JE-^-", D^'") < 1 - ^""^^ '^^'^^ (l-in + d)'" expi-nC5'' 

\no\ V 



(13) 

- (\og\Qn\ +^ogdimHki) > r\ < (n + dY'^exp ( -n min -Dfbllc 

n : / J \ / ^ Hib)>R-f{n,S)/n 

(14) 

where a is the probability distribution consists of the eigenvalues of Pp_^ := Tr^p^ and 
6' is arbitrary real number satisfying < 26' < 6. These inequalities are proven in 
Appendix, Thus, we obtain two equations similar to (p^Of ) and (0). 



6 Discussion 

We construct a quantum universal variable-length code for i.i.d. sources from a quantum 
universal fixed-length code for i.i.d. sources. This construction clarifies the relation 
between the above types codes. We can expect a similar relation in a more general 
setting, which is a future problem. 
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First, we prove inequalities (|^) and (0). We can evaluate the average error as 
6„,p(E^'«,D^'") 



Pn 



k,i 



p 



d(k/n),n j^n 



IT] P. 



( 



X 



1 - Tr 



\ 



y \nS\ ( 


pa{k/n),n ^ 


^T)pn^J^ (/ 


■,a{k/n),n ^ j 




Tr ^ 




) pn 



X Tr 



\ 



[n6] 



p: 



a{k/n),n 



Pn] 



[n6] 



p: 



.d(k/n),n ^ j(^n 



i-Ep|Ep"W.)(T''(^' 



kA 



yd(k/n),n j^r 



B / yn 



k,i 



TjPn 



1- fTrp®" (p. 
4- [n6] [ [ ^ 



yd{k/n),n j(^n 



-r^n pd{k/n),n\ 2 



(15) 



(16) 



where inequality (15) follows from Jensen's inequality concerning the convex function 
X 1-^ x^/^. Note that in the case of inequality (0), dimHs = 1 and p®^ = p®". The 
number of the pair {k,i) satisfying |i^(a) — a{k/n)i — || < | — 5' is [n{5 — 25')] or 
[n{5 — 25')] + 1. When the pair {k,i) satisfies this condition, a{k/n)i < H{a) — 5' and 
a{k/n)i^i > H{a) + 5' . Therefore, using (^ and (^) we obtain 



> 1 - (n + c/)^'^exp(-nC(5''). 



Pp,A^i 

Inequalities (|^ and (0) follow from (|I^) and 

Next, we prove (|]) and ([l3|) . Inequality (H) guarantees that 

dimUu, < rankP,(fc/„),^„„ < (n + rf)4'^e""('=/")»+^ 

For any /c, we let ik be the minimum integer satisfying 



(17) 



- (log \VLn \ + logdim7Yfc,i) > -R, 
n 



i.e., dimHfc,i > e^^/{[n5]l). Since / = (logrf)/5 + 1, 



a{k/n)i^ >R + f{n,6)/n. 



(19) 



o 



From (H) and (0), we obtain the inequality 



Thus, 



^^^«l./n),n^j„ = TrP„(,/„)^^.,„pJ:^ <{n + d^exp ( -n min Z^(b||a) 

H(a)>H+f{n,o)/n 



" 



A; j:1l8h 



pTa <{n + df^e^^ -n min /^(blla^ 



which imphes and 

Finally, we prove (|). Since PR,npnPR,n + Tr[(/ - P/?,„)p„;j^^ > PR,npnPR,n and the 



function x i-^ -^/x is operator monotone, we obtain y PR,nPnPR,n + Tr[(/ — PR^n)Pn\ t^Pr 
\/PR,nPnPR,n- Therefore, 



> 



T^P^iPn 



1 - Tr 



PR,nPnPR,n + Tr[(/ - Pr,,„)p„] 



R,n 



TrP, 



<^P''{Pn) 1-(Tt ^/ PR,nPnPR,n\/K 



Pn 



Pn 



1 - Tr -y/ \/ PnPR,nPnPR,n\/ Pn 



= '^P^{pn) 1 - (^Tr \fJ'nPR,n \fpy 

Pn 
Pn 

<I]p"(Pn)2 [1-Tr p„P^,„] 



(20) 



Pn 



=2 1 - Tr 



X^P"(Pn)Pn^i?,n 



Pn 



2(l-Trp^«'^P^,„) 



where inequality (^) follows from the inequality 1 — < 2(1 — x). Therefore, using 
inequality (|]), we obtain (|^). 
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